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Abstract 

A recent generalization of Gerstenhaber's theorem on spaces of nilpo- 
tent matrices is derived, under mild conditions on the cardinality of the 
underlying field, from Atkinson's structure theorem on primitive spaces of 
bounded rank matrices. 
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1 Introduction 

In the modern geometric theory of matrices, two deep structure theorems stand 
out: Dieudonne's theorem [3] on large spaces of singular matrices - later gen- 
eralized by Atkinson and Lloyd [3] - and Gerstenhaber's theorem on spaces of 
nilpotent matrices [5J. Very recent advances have been made in both problems: 
Atkinson and Lloyd's extension of Dieudonne's theorem has been shown to hold 
for almost all fields [10], while there have been several generalizations of Ger- 
stenhaber's theorem, most notably to trivial spectrum spaces of matrices, 
i.e., subspaces of square matrices having no non-zero eigenvalue in their field of 
definition. 
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In our view, another structure theorem has not received the attention it de- 
served from the mathematics community: it is Atkinson's theorem on primitive 
spaces of bounded rank matrices [2j Theorem B]. In this note, we will expose a 
yet unknown connection between Atkinson's theorem and Gerstenhaber's, and 
will use this insight to obtain a greatly simplified proof of a slightly weaker ver- 
sion of the classification theorem for trivial spectrum spaces of matrices. We 
believe that the technique displayed here has a great potential to deliver new 
insights into the structure of spaces of nilpotent matrices, as it only uses a limit 
case in Atkinson's theorem. 



2 Notation and basic definition 

Here, K denotes an arbitrary field, and M min (K), M„(1C), A n (IfC), GL„(1K) 
respectively denote the sets of all m x n matrices, n x n matrices, n x n 
alternating matrices, and n x n invertible matrices with entries in !€. Two 
subsets V and W of M m n (IC) are called equivalent when there exists a pair 
(P,Q) € GL m (K) x GL n (K) such that V = PWQ, which means that V and 
W represent the same set of linear operators in different choices of bases of the 
source and goal spaces. If m = n, we say that V and W are similar when, in the 
above condition, we require that Q = P^ 1 , meaning that V and W represent the 
same set of endomorphisms of a vector space in two potentially different bases. 

Given a subset V of M m) „(K), the upper rank of V, denoted by urk(V), is 
the largest rank for a matrix in V. Note that two equivalent subsets share the 
same upper rank. 

A linear subspace V of M m) „(K) is called primitive when it satisfies the 
following four conditions: 

(i) V is not equivalent to a space of matrices with the last column equal to 
zero; 

(ii) V is not equivalent to a space of matrices with the last row equal to zero; 

(iii) V is not equivalent to a space V' in which every matrix is written as M = 
[H(M) [?] mxl ] and urktf(V') < urkV; 



(iv) V is not equivalent to a space V' in which every matrix is written as M 
~R(M)~ 

[?]lxn 



and urk_R(V") < urkV. 
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Moreover, we say that V is semi-primitive when it is only required to satisfy 
conditions (i), (ii) and (iii); we say that V is reduced when it is only required 
to satisfy conditions (i) and (ii). 

We note that the upper rank of a semi-primitive subspace of M mjn (]K) is 
always less than n, so that the upper rank of a primitive subspace is always less 
than m and n. As shown by the first statement in Theorem 1 of [I], the primitive 
spaces are the elementary pieces upon which are built all those matrix spaces 
with upper rank less than the number of rows and the number of columns. 
A fundamental example of primitive space can be derived from the canonical 
pairing 



A . 



In the canonical basis (ei, . . . , e n ) of K™ and the lexicographically ordered basis 
(ej A ej)i<i<j<n of K n A K n , one then takes the space S(ip n ) of all matrices 
representing the operators x A — for x € K n . One checks that, for n > 2, S(ip n ) 
is a semi-primitive linear subspace of M/rn n (K) - and even a primitive one if 
n > 2 - and that it is also the space of all matrices 



' X T A 1 
X T A 



(2). 



with X e 



where the matrices A\, . . . are the elements of (eiej — &j^i)\<i<j<n put 

in the lexicographical order. A subspace V equivalent to S(ip n ) is exactly a 
subspace for which there is a basis (B\ } . . . , B^) of A n (K) together with some 

P G GL n (K) such that V is the space of all matrices 

X T B\P 



XTB ® P 



with X £ 
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3 Atkinson's theorem 



We now state a special case of the transposed version of Atkinson's theorem, 
combining Lemma 6 of pQ and Theorem B of [2]. 

Theorem 1 (Atkinson). Let S be a semi-primitive linear sub space o/M m>n (K). 
Set r := urk<S and assume that #K > r. Then, m < r ( r + 1 ) . jf { n addition 
m = r ( r + 1 ^ and r > 1, then n = r + 1 and S is equivalent to S((p n ). 

Note that, for the case m = r ^ r ^ and r > 1, Atkinson only states his result 
for primitive spaces: however, if we put Atkinson's arguments in the context 
of our version (i.e., we transpose them), then the only instance when he uses 
condition (iv) is to discard the case where S might be equivalent to a space S' 



of matrices of the form M 



[?]lxg [ ? ]lx(n-g) 



with q € [0,n - 1] and 

tl \M J [UJ( m _!) X ( n _g)_ 

a reduced space H(S'). But then urkH(S') < r — 1 and Lemma 6 of pQ would 
show that m - 1 < Q < f^ 1 ) - 1 

as, in the terminology of Atkinson and Lloyd, 
the space H(S') would have the column property. 

We also remark that the conclusion of the second statement still holds in the 
case m = 1 and n = 2, as then S = Mx^QK) = S{(f2)- 



4 The generalized Gerstenhaber theorem 

Seemingly unrelated to Atkinson's theory of primitive spaces is the following 
generalization of Gerstenhaber 's theorem, proved in [8j, in which a linear sub- 
space V of M n (K) is called irreducible if no proper non-zero linear subspace of 
K" is globally invariant under all the elements of V. 

Theorem 2 (Generalized Gerstenhaber theorem). Let V be a trivial spectrum 
linear subspace of M n (K) . Then, 

^v< n{n - l) - (1) 

If equality holds in ([1]) with #K > 3 and V irreducible, then V = PA n (I) for 
some P G GL n (K). 

Given P € GL n (K), one then shows that PA n (K) is an irreducible space 
with a trivial spectrum if and only if X t— } X T PX is a non-isotropic quadratic 
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form on ~K n , see [8j Lemma 10]. Theorem [2] may be used to provide a complete 
classification of trivial spectrum spaces with the maximal dimension - up to 
similarity - reducing the problem to the classification of non-isotropic bilinear 
forms [8J Theorem 4] up to similarity. 

The main rationale for studying trivial spectrum spaces stems from their ties 
to affine spaces of non-singular matrices. In short, if A is an affine subspace of 
non-singular matrices of M n (5C) which contains I n , then its translation vector 
space A — I n is a trivial spectrum linear subspace. With little effort, Theorem 
[2] shows that any such affine subspace has dimension less than or equal to (2) , 
and reduces the classification - up to equivalence - of those with the maximal 
dimension to the classification of non-isotropic quadratic forms on K, up to 
similarity [5J Theorem 7]. This has led to similar structure theorems for large 
affine spaces of matrices with a lower bound on the rank [TJ. 

Finally, Theorem [2] yields Gerstenhaber's theorem - which we recall now - as 
an easy consequence for fields with more than 2 elements (for proofs that hold 
regardless of the cardinality of the field, see [9] and [11], while an elegant proof 
that works only for fields with more than two elements is featured in [6]). 

Theorem 3 (Gerstenhaber's theorem [5]). Let V be a linear subspace o/M n (K) 
in which all the matrices are nilpotent. Then, dimV < ^\ and equality 
holds if and only ifV is similar to the space of all strictly upper-triangular nxn 
matrices. 

5 Proof of the generalized Gerstenhaber theorem in 
a restricted setting 

Spread over twenty dense pages, the only known proof of Theorem [2] is a very 
intricate tour de force. Using Atkinson's theorem on semi-primitive spaces, we 
shall now give a much shorter alternative proof under the additional assumption 
that #K>n. 

Let us immediately explain the connection with Atkinson's theorem. Let V 
be a trivial spectrum subspace of M n (K). Then, V has an interesting property 
that was the cornerstone of the proof of Theorem [2] featured in [8j: it is totally 
intransitive in the sense that, for every non-zero vector X £ K", the linear 
subspace VX := {NX \ N € V} is a proper subspace of K n as it cannot contain 
X. 

We now combine this simple fact with a duality argument to create a linear 
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subspace of M m>n (K) with upper rank less than n, where m : = dimV. For 
X G K n , consider the bilinear form 

X : (N,Y) G V x K n i — > y T iVX G K, 

and denote by Ai C M m)n (K) the space of all matrices representing such forms 
in chosen bases of V and K7\ In other words, if the chosen basis of V is 
(Bi, . . . , B m ), then there is an invertible matrix P G GL n (lK) such that, for 
all X G K n , the matrix of X is 

X T BfP 
X T B T m P 

Note that V always satisfies condition (ii) , as a matrix N G M n (IK) which satisfies 
y T iVX = for all (X, Y) G (W 1 ) 2 is necessarily zero. We also see that urk V < n 
since, given a non-zero vector X G K™, the set VX is a proper linear subspace 
of W 1 , which yields a non-zero vector Y G K n for which Y T VX = 0. 

Now, it may very well happen that M. is not semi-primitive, but let us assume 
for the moment that it is. Then, by Atkinson's theorem, we would have m < Q), 
and, in case of equality, we would find a basis (A±, . . . ,A m ) of A n (K) together 
with an invertible matrix Q G GL n (K) such that 



~X T BfP 




~X T A 1 Q~ 


_X T BlP_ 




_X T A m Q_ 



MX G 



this would yield Bj P = AiQ for all i G [l,m], and hence we would conclude 
that 

V = (QP-y A n (K). 



Let us now return to the general case. The proof works by induction on n. 
The result is trivial for n = 1 and we now assume that n > 2. Note first that 
we may always assume that V is irreducible, for if it is not, then, replacing V by 
a similar subspace, we can see that no generality is lost in assuming that there 
exists an integer p G [1, n — 1] such that every matrix of V splits up as 



N 



A{N) 
[0] 



1 Jpx(n-p) 

B{N) 
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where A(N) and B(N) are respectively p x p and (n — p) x (n — p) matrices; 
Aiy) and B(V) are then trivial spectrum subspaces respectively of M p (K) and 
M n _ p (IK), and by induction we deduce that 

dim V < r\ + ~ P ) +P(n ~p)= (fj ■ 

In the rest of the proof, we assume that V is irreducible. 

Let us come back to the matrix space A4. It now satisfies condition (i), 
for if it did not, then we would have a non-zero vector Y 6 K n for which 
VI £ I", ViV G V, Y T NX = 0, and hence all the elements of V would map 
all the vectors of K n into some fixed linear hyperplane of K n , contradicting 
the assumed irreducibility of V. Therefore, M. satisfies conditions (i), (ii), and 
urkA4 < n. If A4 is semi-primitive, then the conclusion follows as explained 
earlier. 



If we now assume that M is not semi-primitive, then we may then find a 
minimal integer d G [l,n — 1] for which the bases of V and W 1 are chosen so 
that every matrix of A4 has the form 



M = [H(M) 



\mx(n— d)\ ) 



H(M) 



and H(Ai) is a linear subspace of M mi d(I€) with upper rank less than d. In 
this situation, we may also modify the basis of V further to the point where, for 
every M € M, we have 

K(M) 

_[0] (m—c)xd_ 

where K(A4) is a linear subspace of M Cj d(lK.) with upper rank less than d and 
which satisfies condition (ii). Then, using the minimality of d and the fact that 
A4 satisfies condition (i), we can see that K(A4) is a semi-primitive subspace of 
M C!f2 (lK) . Theorem Q] then yields 



c < 



Let us now come back to V and see how the above reduction plays out. As 
we may replace V with a similar subspace, no generality is lost in assuming that 
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'Ri(N) 




'A(N) 


C(N) 


R 2 (N)_ 




B(N) 


D(N)_ 



the chosen basis of K n is the canonical one. Let us then write every matrix iV 
of V as 

N 

where Ri(N), R 2 (N), A(N), B(N), C(N) and D(N) are respectively d x n, 
(n — d) x n, d x d, (n — d) x d, d x (n — d) and (n — d) X (n — <i) matrices. Set 
also 

W := Keri?i. 

As K(A4) satisfies condition (ii), one sees that the last m — c vectors of the 
chosen basis of V span the subspace of all matrices of V in which the first d rows 
equal zero, and hence 

m — c = dim W. 



On the other hand, every matrix N of W splits up as 

N 



[0] [0] 
B{N) D(N) 



and hence D(W) is a trivial spectrum subspace of M n _^(IK). By induction, we 

| , and therefore 

2 



have dimD(W) < 



dim W < (n - d) d + 



n — d 
2 



We conclude that 



m = c + dim W < 



2) +(n " d)(i+ (V 



thus completing the proof of inequality ([T]) . 

If we now assume that m = (^) on °^ P rev i° us assumptions, then 
all the above inequalities turn out to be equalities, and in particular we have: 

(a) dimD(W) =(^~ d 

(b) The space V contains every matrix of the form 

[0}dxd Mdx(n-d) 



\(n—d)xd 



[°](n-d) 



x(n— d) 
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Using point (b), we deduce that, for every N G V, the matrix 



A(N) C{N) 
[0] D(N) 

belongs to V, and hence D(V) is a trivial spectrum subspace of M n _^(K). By 
induction, we have dimD(V) < ( n ~ d ) = dimD(W) with D(W) C D(V), which 
yields D(W) = D{V). 

We shall now obtain a contradiction from an invariance argument. Assume 
that some Nq € V satisfies C(Nq) ^ 0. Then, we may find a non-zero vector 
x € K n ~ d such that C(N ) x ^ 0, and then choose R € M n _ d;d (K) for which 

Id [0] " 



RC(Nq) x = x. With the invertible matrix P := 



R I 



n—d 



one computes that 



ViV € V, PiVP 



-l 



A(iV) - C\N)R 
B(N) + RA(N) - RC(N)R 



C(N) 
D(N) + RC(N) 



In the new trivial spectrum space V' := PVP 1 , the subspace of matrices with all 
first d rows equal to zero is still W. Thus, with the above dimensional arguments 
applied to V, we can deduce that D(N Q ) + R C(N ) belongs to D(V) = D(W) = 
D(V), and hence RC(Nq) belongs to D(V). This is absurd because we have seen 
that D (V) is a trivial spectrum subspace of M n _^(K). 

Therefore, C{N) = for all N € V, and hence {0} x K n " d is a globally 
invariant subspace for all the matrices of V. This contradicts the assumed irre- 
ducibility of V and concludes the proof. 
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